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^ ■ Abstract 



Unitary space-time modulation is known to be an efficient means to communicate over non-coherent 
Multiple Input Multiple Output (MIMO) channels. In this letter, differential unitary space-time coding 
and non-coherent space-time coding for the training based approach of Kim and Tarokh are addressed. 
For this approach, necessary and sufficient conditions for multi-group decodability are derived in a 
t/3 , simple way assuming a Generalized Likelihood Ratio Test receiver and a unitary codebook. Extending 

Kim and Tarokh's approach for colocated MIMO systems, a novel training based approach to distributed 
non-coherent space-time coding for wireless relay networks is proposed. An explicit construction of two- 

> 

' group decodable distributed non-coherent space-time codes achieving full cooperative diversity for all 

, even number of relays is provided. 

(N 

^ ■ Index Terms 

^p. ' Cooperative diversity, distributed space-time codes, non-coherent MIMO, training. 
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H I. Introduction 



One of the efficient means to communicate over non-coherent MIMO (Multiple Input Multiple 
Output) channels is the training based non-coherent orthogonal designs approach of Kim and 
Tarokh [1] which offers simple encoding, single complex symbol decoding along with full 
diversity. In this work, we generalize Kim and Tarokh's approach which result in multi-group 
decodable non-coherent space-time codes. Recently the authors of [2], [3], [4] have proposed 
distributed differential space-time coding for wireless relay networks wherein all the terminals 
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operate without the knowledge of any of the fading coefficients and yet achieve full cooperative 
diversity equal to the number of relays. However, the coding strategies proposed in [2], [3], [4] 
put extra stringent conditions (as compared to the colocated MIMO case) on the unitary matrix 
codebook such as the existence of matrices that commute with all the codewords. This makes 
code constructions particularly difficult. For example, the distributed differential space-time code 
constructions in [3], [4] force all the codeword matrices to commute with each other. 
The contributions of this letter can be summarized as follows. 

• Generalization of the non-coherent orthogonal designs based construction of Kim and Tarokh 
[1] by utilizing arbitrary linear designs instead of orthogonal designs alone. We refer to 
the resulting codes as training based non-coherent space-time codes. It is shown that by 
employing any full diversity coherent space-time code, a full diversity non-coherent space- 
time code can also be obtained. 

• For the training based non-coherent space-time codes, necessary and sufficient conditions 
for multi-group decodability are derived in a simple and elegant manner assuming a Gen- 
eralized Likelihood Ratio Test (GLRT) receiver and a unitary codebook. Moreover, the low 
complexity decoder for this case is described in a simple way. 

• Extending ideas from training based non-coherent space-time codes, a novel training based 
approach to distributed non-coherent space-time coding for wireless relay networks is pro- 
posed. This approach does not demand stringent conditions on the structure of the distributed 
space-time code such as commuting codewords which is the case for distributed differential 
space-time codes [2], [3], [4]. Moreover, the channel coherence interval required for this 
strategy (3R + 1 channel uses) is lesser compared to that required for distributed differential 
space-time coding [2], [3], [4] (4i? channel uses). An explicit construction of two group 
decodable codes achieving full cooperative diversity for all even number of relays is also 
provided. 

The rest of the letter is organized as follows. In Section HIl we generalize the training based 
non-coherent space-time coding approach proposed in [1]. A novel training based approach to 
distributed non-coherent space-time coding for wireless relay networks is proposed in Section 
mil and an explicit construction of 2-group decodable codes achieving full cooperative diversity 
is also provided. Simulation results comprise Section |lVl 
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II. Multi-group decodable Training based Non-coherent Space-Time Coding 

In this section, we generalize the training based non-coherent orthogonal designs approach 
in [1] by using arbitrary linear designs which are multi-group decodable instead of orthogonal 
designs alone. 

Let the number of transmit and receive antennas be denoted by n and m respectively. 

Definition 1: [5] A linear design D(xi,X2, ■ ■ ■ ,xk) in K real indeterminates or variables 
Xi, a:2, . . . , Xft- is a n X n matrix with entries being a complex linear combination of the variables. 
It can be written as D{xi,X2, ■ ■ ■ ,xk) = J2f=i^i^i where, Ai G C"^" are called the 'weight 
matrices'. A linear STBC is a finite set of nx n complex matrices which can be obtained by 
taking a linear design D{xi,X2, . . . ,xk) and specifying a signal set C from which the 

1 T 



information vector X 



Xi X2 ... Xk 



take values from, with the additional condition 



that D{a) ^ D{a'),\/ a ^ a' G A linear STBC = {D{X)\X e ^} is said to be g- 
^ real symbol encodable or ^ 



group encodable (or — real symbol encodable or complex symbol encodable) if g divides 



K and if = £/i x J2/2 x ■ ■ ■ x s/g where each s^i, i = 1, . . . ,g C M 9 . An orthogo- 
nal design 6(xi, X2, . . . , xk) in K real variables xi,X2, ■ ■ ■ ,xk is a linear design satisfying 

6(Xi, X2, . . . , Xk)^Q{Xi, X2, . . . , Xk) = {j2iLl ^tj In- 



, where 



The proposed transmission matrix S* in [1] has the form of S 

Q{xi,X2, . . .,Xk) 

6(xi,X2, . . . ,xk) is an orthogonal design in K real variables and of size n x n. The portion 
corresponding to /„ can be viewed as though pilots were sent from each of the transmit antennas. 
Hence this approach is referred to as training based. The authors of [1] then propose to pair two 
real variables at a time to form -|- complex variables and allow them to take values from a PSK 
constellation with appropriate number of points depending on the transmission rate that lead to 
single complex symbol decoding. 

We propose to generalize this approach by replacing the orthogonal design 6(a;i, X2, • • • , xk) 
by an arbitrary linear design D{xi, X2, ■ ■ ■ , xk)- By doing so, we can still reap the benefits of low 
encoding complexity because a linear design has been utilized. To guarantee full diversity, we 
need to construct non-intersecting subspaces [1], [7], i.e., the subspaces spanned by the columns 
of any two codewords should intersect trivially. To be precise, if Si and S2 are two different 



codeword matrices then the matrix 



Si 5*2 



should have full rank equal to 2n. 
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The codewords in our case are given by Si 
where Ci, C2, . 

tion [B For full diversity, we need the rank of 



In 


, ^2 = 


In 










C2 



Cl are elements of a linear STBC ^ 

In In 
Ci Cj 



Cr 



rank 







, . . . , Sl - 
{D{X)\Xe£^} as in"Defini 
to equal 2n, which is same as 



n + rank (Cj — Cj). Thus if the matrix Ci — Cj has rank of n for 

Cj Cj Cj 

all Cj 7^ Cj G full diversity is guaranteed. Thus, by simply vertically augmenting /„ with 
a fully diverse linear STBC ^, we get a fully diverse training based non-coherent space-time 
code. 

Though, it is not necessary for the elements of to be unitary matrices for achieving full 
diversity, we assume that they are unitary in the sequel in order to simplify the decoding 
algorithm. We assume the channel model to be as follows: Y = SH + W where, H denotes 
the n X m channel matrix, S denotes the transmitted 2n x n codeword, Y denotes the 2n x m 
received matrix and W denotes the 2n x m matrix with entries being a sample of zero mean 
complex Gaussian random variables with unit variance. We assume that the GLRT receiver is 
employed for detection at the receiver. For a unitary codebook, the GLRT receiver detects the 
codeword as follows [7], 

^= max Tr [Y^ SiSfY] . (1) 

2 = 1, ...,L 

Let us simplify the above GLRT metric for codes of the specific form proposed. For our 

case S, = ^" for some C, e ^. We have Tr \Y^SiS^Y] = Tr \YY^SiS^] . Moreover 
C 



SiSi 



H 



In 








c. 


c.cf" 




Ci In 



where, the second equality is due to our unitary matrix 



assumption. Let us partition the received matrix Y into sub-matrices Yi and Y2 as follows: 



Y 



Yo 



. Let Yi denote the part of Y corresponding to the transmission of /„ (pilot part) 



and Y2 denote the other part due to the encoded message. Then, we have Tr [FF^S'jS'/^] = 
Tr \YiY-^ + Y2Y^] + Tr [YiY^^d + Y2Yl^C^] . The term Tr [YiY-^^ + YiY^] does not depend 
on Cj and hence can be ignored for decision purposes. Recall that Cj was obtained by substi- 
tuting for real variables xi,. . . ,xk in a linear design D{xi, . . . ,xk)- Let Cj = J2f=i^]^j 
where x\,X2, . . . ,x]^ denote the specific values corresponding to Cj taken by the variables 
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xi,X2, . . . ,xk- Decoding to Ci is thus same as decoding to the values taken by the set of 
variables or in other words decoding to the information vector X. Then the GLRT decoder can 
be rewritten as follows. 



It is clear that if is (/-group encodable, then the maximization in (O can be broken up into g 
individual maximizations each of which is over only a subset of the K variables since the real 
variables in a group takes values independently of the real variables in the other groups. Then 
the real variables in each group can be decoded independently of the real variables in the other 
groups. We refer to such codes as multi-group decodable codes. Note that the above decoder is 
very general in nature and also explains in a simple way how single complex symbol decoding 
can be done for the 2 x 2 non-coherent orthogonal design proposed in [1]. 



In this section, a novel training based approach to distributed non-coherent space-time coding 
for achieving cooperative diversity in wireless relay networks is proposed. 

Consider a wireless relay network as shown in Fig. [T] with a source terminal, a destination 
terminal and R relay nodes. We assume all the nodes in the network to be equipped only with 
single antennas. The fading gain of the channel between any two terminals is modeled by a 
zero mean complex Gaussian random variable with unit variance. The additive noise at all the 
terminals is modeled as a zero mean complex Gaussian random variable with unit variance. Let 
fi denote the channel fade coefficient between the source and the i-th relay and let gj denote 
the channel fade coefficient between the j-th relay and the destination. All the terminals are 
assumed to be symbol synchronized and half-duplex constrained. In this setting, we propose a 
training based distributed space-time coding strategy using which all the terminals can operate 
without the knowledge of any of the channel fading coefficients. It is important to note that 
though pilot signals are used in this strategy, none of the terminals are required to estimate the 
channel fade coefficients. 

The transmission from source to destination consists of two stages. Each stage consists of two 
phases - a pilot phase and a communication phase. The first stage consists of Ti + 1 channel 
uses. During the pilot phase of the first stage, the source transmits the complex number 1 to 
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all the R relays using a fraction tti of the total power (sum of the power used by the source 
and all the R relays) denoted by P. Then the received symbol at the i-th relay denoted by rf 
is given by rf = \J'i\\Pji + where, rij represents the additive noise at the 2-th relay. During 
the communication phase of the first stage, the source transmits a vector s of size Ti x 1 taken 
from a codebook C satisfying E {s^s} = T\ using a fraction tx\ of the total power P to all the 
R relays. This vector s actually carries the message intended to be communicated by the source 
to the destination. Thus the received vector during the communication phase at the i-th relay 
denoted by r| is given by r| = ^JixxPjiS + Vi where, Vi represents the additive noise vector at 
the z-th relay. 

During the second stage, the relays linearly process the received signals from the source 
(which contains the pilots and the message) and relay the information to the destination. The 
pilot phase of the second stage consists of R channel uses. Of these R channel uses, one of 
them is allocated to each one of the R relays for transmission. During its scheduled transmission 
slot, the z-th relay transmits a scaled version of rf using a fraction 1x2 of the total power P. The 
symbol transmitted by the i-th relay is given by t\ = yj~^^r\ = \l^^:fi + yj~^^ni. 
In the communication phase of the second stage, all the R relays transmit together a linearly 
transformed version of rf or its conjugate rf using a fraction 112 of the total power P. For this 
purpose, each relay is equipped with a complex matrix Bi of size T2 x Ti, which we call the 
'relay matrix' that satisfies || Bi |||.< T2. The duration of the communication phase in the second 
stage is thus T2 channel uses. To be precise, the vector transmitted by the i-th relay denoted as 
t| is given by t| = \J~^^Birl = ^J^^^fiBiS + .J^^BiVi where the notation x denotes 
either x or x* according to the context. The four phases in the entire transmission protocol are 
pictorially depicted in Fig. [21 The power allocation factors tti and 112 have to be chosen so as to 
satisfy iviPTi + n2PRT2 = P(Ti + T2). Throughout this letter, we choose tti = 1 and 112 = j^. 
In the proposed transmission protocol, the destination is scheduled to receive signals only during 
the second stage. Let yi and 2/2 denote the received vector at the destination during the pilot 
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phase and communication phase respectively of the second stage. Then, we have 



R 



yi 



Ui 



i=l 



niP 



h9i 

hg2 

fRdR 



+ 



g2n2 
QrUr 



+ Ui 



where, the vector ui represents the additive receiver noise at the destination during the pilot 
phase of the second stage. Similarly, we have 

fi9i 



y2 



711712 



p2 



TViP 



Bi~s B2S 



Brs 



f292 



IrOr 



+ 



TT2P 



R 



TViP 



Vi]+U2 



i=l 



where, the vector U2 represents the additive receiver noise at the destination during the commu- 

1 T 



nication phase of the second stage. Let wi 

W2 



TTlP+l 



giui g2n2 



QRnR 



ui and 



U2 which denote the equivalent noise as seen by the destination 
during the pilot and communication phases. Then we have the following signal model for the 
total received vector y at the destination. 

fiQi 

Ir 1292 

Bis B2S . . . Bjis 



y 



yi 
y2 



7ri7r2P2 
7riP+ 1 



fRdR 



W2 



(3) 



Essentially we observe that the signal model becomes identicaljo a linear fading MIMO channel 

Ir 



y= \/^^SH + W where, S 



H 



fl9l f292 ■■■ fR9R 



and W 



Wi 



W2 



Bis B2S . . . Brs 
The difference here as compared to the case of colocated MIMO channels 



is that here the entries of the channel matrix i7 are a product of two Gaussian random variables 
and the entries of the equivalent noise vector W are not complex Gaussian distributed. 

For simplicity, we restrict ourselves to choosing C and the relay matrices Bi, B2, ■ ■ ■ , Br such 



that the set of matrices ^ 



BiS B2S 



Brs 



consists of only unitary matrices. Let 



Y<o\ = \C\ = L and let the elements of be denoted by Ci,C2, ■ ■ ■ ,Cl- Then the distributed 
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non-coherent space-time code consists of L scaled unitary matrices of the form S denoted by 
Si, 5*2, ... , Sl- For such a unitary codebook, imitating the colocated MIMO case we propose to 
use a suboptimal mismatched decoder at the receiver as shown below: 

S= max Tr \Y" S.S^Y] . (4) 

i=l,...,L 

We call this decoder as mismatched because the entries of the equivalent noise vector W are not 
Gaussian distributed. Furthermore, this receiver is suboptimal because conditioned on knowing 
gj,j = 1, . . . , R, the covariance matrix of is a diagonal matrix and not a scaled identity 
matrix. This fact is not exploited by the decoder in (H)) and hence is suboptimal. The following 
theorem states that this suboptimal mismatched decoder already gives full cooperative diversity 
equal to R. 

Theorem 1: If BiB^ are diagonal matrices Vi = 1, . . . , /? and if C^Ci = CiC^ = Ir, = 
1, . . . , L then full diversity equal to R is achieved by the suboptimal mismatched decoder in Q 
if rank {d - Cj) = R for all d ^ Cj G ^. 

Proof: The proof follows on the similar lines as the proofs in [3] and hence omitted. ■ 

can be viewed as a linear 



Observe that the sub-matrix of S given by 
;sign if the vector s is obtained from Ti coi 

set. Then the codewords of a distributed non-coherent space-time code look like Si 



Bis B2S . . . Brs 

design if the vector s is obtained from Ti complex variables which take values from some signal 

Ir 
_ Q _ 

Ci E ^ where ^ is now a linear space-time code. The difference here as compared to the 
colocated MIMO case is that is obtained from a conjugate linear design (a linear design in 
which any column contains complex linear combinations of only the complex variables or only 
their conjugates) as opposed to any arbitrary linear design. In this correspondence we consider 
only such distributed non-coherent space-time codes since they are easier to study and their 
encoding complexity is also less. Moreover, the notion of multi-group decodable codes can then 
be utilized in the distributed setting also. 

Note that in the proposed coding strategy, the channels between all the terminals are assumed 
to be quasi-static for a duration of Ti + T2 + R + 1 channel uses. Of the total Ti + T2 + R + 1 
channel uses, note that R + 1 channel uses are employed for training purposes. Supposing the 
channel coherence interval is much more than Ti + T2 + R + 1 channel uses, then we can stop 
the pilot phases after the first Ti + T2 + R + 1 channel uses and the source can henceforth 
transmit data once very Ti + T2 channel uses. In this work, we let Ti = T2 = R for which the 
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channel coherence interval should be 3R + 1 channel uses. At this juncture we would like to 
point out that distributed differential space-time coding [2], [3], [4] on the contrary demands 
a channel coherence interval of 4i? channel uses but can however enable the source transmit 
once every 2R channel uses always. Also note that the proposed strategy does not demand the 
existence of matrices that commute with the codeword matrices and a carefully chosen initial 
vector which is the case for distributed differential space-time coding [2], [3], [4]. Furthermore 
though pilots have been employed in our transmission strategy, the relays do not estimate the 
fading gains from the source to the relays, but instead simply amplify and forward the pilots to 
the destination. 



A. Explicit Coding 

In this subsection, we construct a class of 2-group decodable fully diverse unitary space-time 
codes derived from PCIODs which can be employed as distributed non-coherent space-time 
codes. These codes can also be used in colocated MIMO systems for application either in the 
differential setup or in the training based setup as described in Section UIl PCIODs were first 
proposed for use as coherent distributed space-time codes in [6]. 

Construction 1: [6] Given an even number R, the rate one, R x R PCIOD Cp is given as 
follows: 



Cf 



diag 



Xl + 1X2 —X3 + 1X4 
x:j + ix4 Xl — ix2 



Xk+lXk+l -Xk+2+l^k+3 
Xk+2+iXk+3 ^k-i^k+i 



X2R-3+^X2R-2 -X2R-1+IX2B. 

X2R~l + iX2R ^2R-3 - iX2R-2 

(5) 



i=2ij-3 ^i) from which we 



There are totally 2R real variables in the linear design Cp. 
We have C^Cp = diag { (Eti xf) h, . . . , (E-=! xf) h, . . 
infer that PCIODs do not lead to unitary codewords for arbitrary signal sets. But this can be ac- 
complished by appropriately choosing multidimensional signal sets such that for all signal points 
Ylt=i^'i = ■■ ■ = Yld=2R-'i^'i ~ 'To obtain full diversity, we first note that |ACp ACp| = 
(X]^=i ^xf)^ . . . (^f=2R-3 ^^f) ' where ACp has been used to denote the difference matrix. 
Thus PCIODs do not offer full diversity for arbitrary signal sets. To get unitary matrices and full 
diversity we propose to choose the multidimensional signal points as follows. Firstly we form R 
complex variables Si, S2, . . . , sr given by: si = Xi + ix2, S2 = X3 + ix^, . . . ,sr = X2R-1 + ix2R. 
Now with this assignment of complex variables note that PCIODs are conjugate linear designs 
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which is a necessary requirement for application in the distributed setting. We group these R 
complex variables into two groups - First group:si, S3, ... , sr-i and Second group: S2, S4, • • • , sr. 
Then si and S2 are allowed to take values independently from a PSK signal set with number 
of points depending on the transmission rate requirement. Then we let the complex variables 

53, S5, . . . , s/j_i to be some rotated versions of the specific value chosen by Si. Similarly we let 

54, se, . . . , Si? to be some rotated versions of the specific value chosen by S2. Thus all the complex 
variables take values which lie on the unit circle. Then with this choice of multidimensional signal 
set, it is easy to check that the resulting codewords are fully diverse and unitary. PSK signal set 
has been employed in order to obtain unitary codewords. It is also clear that the resulting code 
is 2-group encodable and 2-group decodable. 

IV. Simulation Results 

In this section, it is verified by simulations that full cooperative diversity is achieved if the 
rank criterion as stated in Theorem [U is satisfied. The distributed non-coherent space-time code 
employed has been obtained using a unitary space-time code derived from the Alamouti design 
with a QPSK constellation. Thus Ti = T2 = R = 2 and hence the transmission rate of the source 
is I bits per channel use. Fig. [3] shows the error performance of this code under single complex 
symbol decoding as in ^ from which it can be observed that the slope of the codeword error 
rate is almost 2 in the high SNR regime as expected. 
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Fig. 1. A wireless relay network 



1 Ti , , R , , ]i 



Pilot 


Communication 


Pilot 


Communication 


phase 


phase 


phase 


phase 



First stage Second stage 



Fig. 2. Four phase transmission protocol 
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Fig. 3. Error performance of Alamouti design with QPSK constellation employed as a distributed non-coherent space-time 
code in a two relay system 
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